Cost - Effective , High - Performance Giga - Scale Checkpoint / Restore

نویسندگان

  • Andreas Moshovos
  • Alexandros Kostopoulos
چکیده

Computer Engineering Group Technical Report Electrical and Computer Engineering Department University of Toronto November 18, 2004 Abstract This work proposes a novel checkpoint store compression method for giga-scale, coarse-grain checkpoint/restore. This mechanism can be useful for debugging, post-mortem analysis and error-recovery. The effectiveness of our compression method lies in exploiting value locality in the memory data and address streams. Previously proposed dictionary-based hardware compressors exploit the same properties however they are expensive and relatively slow. We observe that because program behavior is typically not very random much simpler mechanisms can offer most of the compression benefits. We propose three compressors that exploit value locality using very small direct mapped structures. Our compressors require few resources, can be easily pipelined and can process one full block per processor cycle. We study two uses of our compressors for post-mortem analysis: (1) Using them alone, and (2) using them in-series with a dictionary-based compressor. When used alone their offer relatively competitive compression rates in most cases. We demonstrate that when combined with previously proposed hardware-based compression methods, our compressors improve overall compression rates while significantly reducing on-chip buffer requirements. Specifically, with an on-chip buffer of just 1K bytes a combination of our compressor with a dictionary-based compressor results into an overall performance slowdown of just 1.6% on the average for storing checkpoints to main memory. Moreover, this combination reduces checkpoints to 34% of their original size. The dictionary-based compressor alone even when used with a 64Kbyte on-chip buffer incurs an overall performance slowdown of 3.7% and reduces checkpoints to 38% of their original size. The worst performance slowdown is 4.4% for our compressor and 11% for the dictionary-based compressor alone. Thus with a lot less resources (1Kbyte vs. 64Kyte on-chip buffering) our technique offers better performance and compression. When used alone, our compressor reduces checkpoint storage to 52% of its original size. While not as good as dictionary-compression this reduction is possible with very few resources (a dictionary-based compressor requires millions of transistors while our compressor few thousand). All aforementioned results are for a checkpoint interval of 256 million instructions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Turbo-ROB: A Low Cost Checkpoint/Restore Accelerator

Modern processors use speculative execution to improve performance. However, speculative execution requires a checkpoint/restore mechanism to repair the machine’s state whenever speculation fails. Existing checkpoint/restore mechanisms do not scale well for processors with relatively large windows (i.e., 128 or more). This work presents Turbo-ROB, a checkpoint/restore recovery accelerator that ...

متن کامل

Three-Dimensional Electro-Thermal Modeling of Thin Film Micro- Refrigerators for Site-Specific Cooling of VLSI ICs

Non-uniform high temperature distribution has become a primary concern in view of VLSI performance and reliability. This problem becomes much severer in the nanometer-scale technologies as the required operating frequency moves into the multi-giga Hertz range. Effective handling of “hot spots” is an important issue. Recently, a cooling technology based on heterostructure integrated thermionic (...

متن کامل

Effective Simulation for the Giga-scale Massively Parallel Supercomputer SR2201

A high performance parallel network simulation environment was developed in the SR2201 project. The SR2201 is one of the highest performance massively parallel supercomputers in the world. The enhanced simulation algorithm achieved a 2.4 times increase in simulation speed compared with conventional simulation methodology. A 98% detection rate for all design errors before physical design contrib...

متن کامل

Nrd-guide Technology for Millimeter-wave Wireless Applications

Starting with a brief review of millimeter wave research and development, basic concept, theory and applications of NRD-guide technology are described. A particular emphasis is placed on short distance propagation of millimeter waves such as indoor HD-TV wireless transmission and Giga-bit video downloader. It is also emphasized that NRD-guide can offer advantages of easy fabrication, low cost a...

متن کامل

Nano and Giga Challenges in Electronics Photonics and Renewable Energy (NGC2011) Moscow-Zelenograd, Russia, September 12-16, 2011

This special issue of Nanoscale Research Letters is a collection of selected papers presented at the Nano and Giga Challenges in Electronics, Photonics and Renewable Energy (NGC2011) conference in Moscow and Zeleno grad which addresses both theoretical and experimental achievements and provide a stimulating outlook for technological developments in these highly topical fields of research. Infor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004